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Abstract. We study a new random matrix ensemble X which is constructed by an application of a two 
dimensional linear filter to a matrix of iid random variables with infinite fourth moments. Our result 
gives asymptotic lower and upper bounds for the spectral norm of the (centered) sample covariance 
matrix XX 1 when the number of columns as well es the number of rows of X tend to infinity. 



1. Introduction and main result 

A random matrix ensemble is a sequence of matrices with increasing dimensions and randomly 
distributed entries. Random Matrix Theory (RMT) studies the asymptotic spectrum, e.g., limiting 
eigenvalues and eigenvectors, of random matrix ensembles. A comprehensive introduction into RMT 
can be found, for instance, in the textbooks [2] and [4]. In Davis et al. [8] the authors study the asymp- 
totic properties of the extreme singular values of a heavy-tailed random matrix X the rows of which 
are given by independent copies of some linear process. This was motivated by the statistical analysis 
of observations of a high-dimensional linear process with independent components. Typically, the 
linear processes used in multivariate stochastic modeling have the more general form 



X t =YA^Z H , t=\,. 



j 



,n, 



where A ti> is a sequence of deterministic p x p matrices and Z t is a noise vector containing p inde- 
pendent and identically distributed (iid) random variables Z\ u . . . ,Z pt . Of course, the process X does 
not have independent components except when A (■>> is a multiple of the identity matrix. Let us denote 
by X the matrix with columns Xi, . . . , X n . Then the it-th entry of X is given by 

p ^ 

= Zj A ik ^-i- 

j k=\ 

This motivates to study the general random matrix ensemble 

j k 

with some iid array Z = (Z/ f ) and some function : N X Z 2 -> R, (i, j, k) i-» d(i, j, k) such that the 
above double sum converges. The matrix X can be seen as a two dimensional filter applied to some 
noise matrix Z. The spectral distribution of these matrices has been studied for Gaussian matrices 
X and d(i,j,k) = d(j,k) by [9], and for more general light-tailed distributions by [1] under the 
assumption that d(j,k) = if j or k is larger than some fixed constant. We investigate the case where 
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the function d can be factorized in the form d{i, j, k) = cfik- Thus in our model the random matrix 
X = (X it ) e W xn is given by 



(1-1) X it = 2, Zj c fik Z i-k,t-h 

j k 

for two real sequences (c ; ) and (%). In contrast to the model X = (X,- f ) considered in Davis et al. 
[8], with 



it — 2_j c j Z iJ-j' 



X 



the matrix X has not only dependent columns but also dependent rows. Indeed, writing the model 

(1.1) in the form 

(1.2) X; ; Y^nti, , 

j 

(1-3) fo = 2^-*.*- 

k 

one can see that, by going from X to X, the noise sequence Z in the processes along the rows is 
replaced by a linear process £ along the columns. Since we want to investigate a heavy-tailed random 
matrix model we assume that (Z^)^ is an array of regularly varying iid random variables with tail 
index a e (0, 4) satisfying 

(1.4) nP(\Z n \ >a n x) -+x~ a . 
Furthermore, let {cj) and (6%) be sequences of real numbers such that 

(1.5) Y l c / < °°' and 

j 

(1.6) V \9kf < oo for some 6 < min{a, 1}. 

k 

If 5/3 <o'<4we also require that Z\\ satisfies the tail balancing condition, i.e., the existence of the 
limits 

P(Z n > x) P(Z n < -x) 

(1.7) lim — ^ L = q and lim — — f = 1 -q 

™* P(\Zn\ > x) H ^P(|Z u |>x) 

for some < q < 1. By the above definitions, X is a p x n random matrix with dependent entries 
with infinite fourth moments. Under the assumption that p and n go to infinity such that the ratio 
pin converges to a positive finite constant, Soshnikov [11, 12] and Auffinger et al. [3] have studied 
the eigenvalues of heavy-tailed random matrices with independent and identically distributed entries. 
Bose et al.[7] investigate the spectral norm of circulant type matrices with heavy-tailed entries. In the 
following we assume that both p — p n and n go to infinity such that 

(1.8) lim sup -| < oo 
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for some ft > satisfying 



yg < oo 
j6 < max 

j6 < max 



if a 6 (0, 1] 
2-a f 



a-1'2 



- a 



4(a- i; 



if ore (1,2), 



if 2 < a < 3, 



or 



if 3 < a < 4. 



(1.9) 



and// 



2 and EZ 2 n 



3a -4 

Recall that any symmetric matrix A has real eigenvalues. The spectral norm ||A|| 2 of A is given by 
the maximum of the absolute values of the eigenvalues of A. For X given by (1.1), our main theorem 
investigates the asymptotic behaviour of the spectral norm ||S|| 2 of the centered sample covariance 
matrix S = XX T — n/ux,aHH J , where 



E (z 2 n l [Z 2 n<<] ) XjC* 
{ E(z\ l )Y jj c) 
(Hjj) € W x3 p is given by 

(1-10) Hij = 8p-(j-i)l{0^j-K2p}- 

Observe that the diagonal entries of nfix,aHH T are exactly the means of the diagonal elements of XX J 
if the observations have a finite variance. In case the observations have an infinite variance, we do not 
have to center, except when a = 2 and EZ^ = oo, where we use a truncated version of the mean. In 
the latter case jix,a also depends on p and n. 



for < a < 2, 
for a 
else, 



Theorem 1. Consider the random matrix model given by equations (1.1), (1.4), (1.5) and (1.6) with 
a € (0, 4). If a e (5/3, 4) we assume that Z\\ has zero mean and satisfies the tail balancing condition 
(1.7). Denote by S = XX J — npx,aHH J the centered sample covariance matrix, with px,a and H = 
(Hij) € RP X ^P as given in (1.9) and (1.10). Let T\ be an exponentially distributed random variable 
with mean one and x > 0. If p and n go to infinity such that condition (1.8) is satisfied then we have 
for the spectral norm \\S || 2 of S that 

I \ 

r: 2/Q; max^ V c 2 > x ^liminf p(||5|| 2 > a l D x) 

j ! 

<limsupP(||S|| 2 > al p xj 

"—>oo 

T- ] 2,a m^\e l \Y J \0k\Y J c ) >x 



(1.11) 



* j 

Remark 2. (i) If all 6k 's except one are zero, one has equality and therefore recovers the result 



from [8, Theorem 1 ]. If two or more 6k are non-zero, then 

\ 



r; 2/a max^^ c 2 > x 



l 



< P 



r- 2/ff max|^|^|^|^c5>x . 

k j 

Whether the lim inf and lim sup are equal in this case and attain one of its boundaries remain 
open problems. 
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(ii) Since P(T 2 ^ a < x) = e x "'\ inequality (1.11) can equivalently be written as 



r -«/2 . 



exp 



-x~ a/2 max\8i\ a/2 



\ 

2 
j 

J J 



a/2\ 



;liminfW||S|| 2 < a* x) 
;limsupP(||S|| 2 <a 2 np x) 



: exp 



x~ a/2 max \9 k f 



2>5 



a/2\ 



\ J ) ) 



Results from the theory of point processes and regular variation are required through most of this 
paper. A detailed account on both topics can be found in a number of texts. We mainly adopt the 
setting, including notation and terminology, of Resnick [10]. 



2. Dependence of successive rows 

To understand the basic principle of our method it is beneficial to first investigate the case where 
only successive rows of X are dependent and where a e (0, 2). Since nx,a = for a < 2, S = XX T 
and therefore the spectral norm of S is equal to the largest eigenvalue of XX T , i.e., ||5||2 = A max . We 
start with the model 

(2-1) X U /• 

i 

(2.2) & = Z it + 0Z ( _ U . 

It is easy to see that X, f = Xj t + 6*X,_i if , where Xj t = J^j CjZij-j for i = 0, 1, . . . ,p, and t = 1, . . . , n. 
To proceed further we define the matrices X = (X it ) € W xn , X = (X(;_i), t ) e R^ +1 ) x " and H = 
(Hjj) € R^ x (p +1 ), where all entries of H are zero except Hu = 9 and //,-,;+ 1 = 1. Then we clearly 
have the matrix equality 

(2.3) X = HX. 

Moreover, we denote by D = (A) = diag(XX T ) £ r(p +1 ) x (p +1 ) the diagonal of XX 1 , that is the 
diagonal matrix which consists of the diagonal entries of XX J . For the convenience of the reader, we 
restate the result from [8, Proposition 3.4]. 

Proposition 3. Under the conditions of Theorem 1 we have that 

a ; 2 \\xx J - d|L -A o. 

f ii iiz n—)oo 

Thus, since \\H\\ 2 < ||#]L < 1 + \9\, we immediately conclude, by (2.3), that 

(2.4) a' 2 \\XX J - HDH J \\ 2 < \\H\\ 2 2 a' 2 \\XX J - D\\ 2 0. 



Hence, by Weyl's inequality ([5, Corollary III.2.6]), the largest eigenvalue A max of the sample covari- 
ance matrix XX J based on the observations X is asymptotically equal to the largest eigenvalue of the 
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tridiagonal matrix 



(Di + 9 2 D 2 9D 2 



(2.5) 



HDH 







0D 2 D 2 + 9 2 D 3 8D 3 








+ e 2 D p 

6D n 





d p + g 2 d p+1 



e R pxp . 



It is our goal to find an asymptotic upper and lower bound for /l max . First we prove a lower bound. 
Clearly, A max is asymptotically larger or equal than the largest diagonal entry of HDH 1 , i.e., 



(2.6) 



A max > max (A- + 6 2 D i+ y) + o P (a 2 np ), 



where op(l) denotes some generic random variable that converges to zero in probability as n goes to 
infinity. Since D i+ i = E" = i X 2 we have to find the maximum of an MA(1) process of partial sums 
of linear processes. By [8, Proposition 3.5] we already know that 

p p 

(2.7) 



!=0 !=0 (=1 

Since (D,) is an iid sequence, this result can be generalized as follows. 
Lemma 4. Under the conditions of Theorem 1 we have that 

p CO 

l P = Tj % 2 (A- + i,A) 1 = Yi (V" 2^(1.0) + V"S^(0,1) • 
1=1 1=1 

Proof. By the continuous mapping theorem applied to (2.7), we immediately conclude that 

p 



7 P - Z ( e ^ 2 (A+i,0) + e v 2 (0,D,)) ^ 7 " 
i=l 



Thus, we only have to show that \I p (f) - I* p (f)\ — > in probability for any continuous function with 
supp(/) c {x = (x\,x 2 ) £ 1R 2 : max{|^i|, |jc2|} > <5}- To this end, let L = {x : mm{\x\\, |jc2|} < 5} and 
observe that, by independence of (Di), 

El p (L c ) < pP(|D i+1 | > a*,*, |D ; | > a*,*) = 0(6~ V 1 ) -> 0. 

Thus /,(/) = J^/rf/p + o P (l) and, by definition of I*, I*(f) = j L fdI*. Since f(z) = if 
max{|xi|, fell < 5, it suffices to show that 



i=l 

p 

+ J] /( a «p( D '-+i, A-))l {fl -2|A +1 |<tf}ni fl ^| D ,.|s, 5 , -/«p(0,D;))l |fl -2 |A| ^, 



0. 



/=! 
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We only treat term A, as B can be handled essentially the same way. To this end, observe that 

p 

A <3/fcp(A'+l.A)) - /(a~*(A+l,0))| l { a;f\D i+l \>5}nfa 2 P \Di\<6} 



/=i 

p 



+ Yj l/( a «p( D ''+l'°))l 1 I^IA + ,l^ln{^IAI^l =I + U- 
i=l 

Clearly, by independence, 

E(II) < sup f(x)pP(a- n 2 p \D i+l \ > 6)P(a n 2 \D i \ > 6) = 0(p' 1 ) -» 0. 
Furthermore, we have, for any < r\ < 6, that 

l {a^\D i+l \>6\n{a^,\D,\<6\ < 1 {a;||D; +1 |><5}n{a;||A|<'7} + H^Di+^ni^Dfer,}- 

Thus, for some c > 0, 

E(I) <sup{|/(xi,X2) -/(*i,0)| : |xi | > <5,|x 2 | < ^(lA+ll > a 2 p 7/) 

+ cpP(\D i+1 \ > a 2 np rj)P(\Di\ > a 2 ^). 

Obviously, the second summand converges, for fixed rj > 0, to zero as n — > oo. The first summand can 
be made arbitrarily small by choosing rj small enough, since / is uniformly continuous. □ 



The continuous mapping theorem applied to Lemma 4 gives 

p oo 

1=1 (=1 
Therefore, by (2.6), the asymptotic lower bound of A max is given by 



(2.8) a 2 max (D f + 2 D m ) A max{l, 2 }i7 2/£> V c 2 

Regarding the upper bound, we make use of the fact that ||//D// T || 2 < ||//D// T || oo . Observe that 
WHDH 1 ^ = max (l m} \6\Di + D t + G 2 D i+i + |0|D i+1 l {fetp} ) 

= max ((1 + |0|%i } )D; + (lei^pj + &)D i+ A . 

So once again we have to determine the maximum of an MA(1) of partial sums of linear processes. 
An application of Lemma 4 yields that 

(2.9) a; 2 WHDH 1 ^ A max{l + \8\, \6\ + fl 2 }^ 2 ' £ 4 

j 

The lower and upper bound (2.8) and (2.9) together with equation (2.4) finally yield that 

/ ^\ 

P 

j ) 

< lim sup P Umax > a 2 np x) 

n— >oo 



max{ 1 , 6 2 }T~ 2/a Y c2 J >x < H ™ S f P t 71 " 13 " > a "P x ) 



^P 



(|0| + max{l,0 2 })r- 2/ ^c 2 >^ 
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Clearly, this result is a special case of Theorem 1 when the process ^ t is a moving average process of 
order one. 

3. Proof of the theorem 

In this section we will proof Theorem 1 in its full generality. We start with the case where a < 2. 
To this end we define an approximation 

x( P ) 

of X and so that 



2 n^oo 



2 n— »oo 



(3.1) (i) af p £M(lM) T -HDH 1 

(3.2) (ii) a~l XX T -X^(X^) J 

(iii) and finally we derive upper and lower bounds for ||//D// T || 2 . 

Note that, for notational convenience, we will assume that % = for k < 0, since the extension of the 
proof to the case where the dependence in (1.3) is two-sided is analogous. 

(i). First we define the approximation £W = (xj f p) ) € RP xn by xj f p) = Zf = o where 
Xit = >ZjCjZ u .j. Furthermore we define X = {X Hht ) e R 2 P Xn , and H = (H u ) e M px2 P by 

(3 3) H ..-{ 6 P-{H) i£0<j-i<p, 

^ M 'J~\ o else. 

Then we have that HX = X^\ Indeed, 

2p i+p p p 

{HX) it = ^ HuXi-pj = ^ HuXi- p j = ^ Hjj + iX i+ i- p j = ^ 
/=o l=i /=o /=o 

p 



/ \ OkXi-kj 



p-(p) 



X 



k=0 

Thus, if we let D = (D,) = diag(XX T ) e W? px2p , then we obtain (3.1) by virtue of Proposition 3 and 

ii^ii2<n^iioo<i:, Loi^i<~- 

(ii). In order to proceed we will require the following lemma. 

Lemma 5. Under the conditions of Theorem 1 we have, for < a < 2, that 



p CO oo 

Z e "n 2 P zr=o ft EJL, xh,, Z Z e rr 2/ > Z, c) ■ 



i=l i=l k=Q 

Proof. A straight-forward generalization of Lemma 4 yields, for any m < oo, that 

p m oo 

(3 " 4) Z e ^ 2 Z? =1 (%,r---*L,, ( ) Z Z Vl;^' 

!=1 fc=0(=l 

where denotes the k-th unit vector of K°°, i.e, the k-th component of eu is one and all others are 
zero. By an application of the continuous mapping theorem we obtain the claim for a finite order 
moving average of the partial sums (Z"=i ^|)i> ie > 



p oo m 

Z e rt Z? =0 ft ZU x h, Z Z e Tf /a e k z, c) > 



7=1 i'=lfc=0 
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On the other hand we have, for m — > oo, that 

oo m 



Z Z e rr 2/ "4 Z 7 c) Z Z e rr 2/ '% £, • 



i=l &=0 (=1 k=0 

To finish the proof of the lemma it is, by [6, Theorem 3.2], therefore only left so show that 

( p p \ 



lim lim sup p 



0, 



Z e "nj> Z*=o e * 2"= i x f- k , ' Z e "n} 2™=o e * 2" = l X? ^ ( 
\i=l i=l J 

where p denotes a metric of the vague topology on the space of point processes. To this end, observe 
that 



m n 



oo n 



Z 6k Z x ht ~ Z 6k Z x ht < Z m Z x hr 

k=0 f=l &=0 r=l k>m t=\ 

Therefore, by the arguments of the proof of [8, Proposition 3.5], we only have to show, for any y > 0, 
that 

lim limsupP((A^) c ) = 0, 



where 



Observe that 



A y n = | max £ J]*^ <a 2 np 



y ■ 



l>m t= 1 



P((Al) c ) < P P 



\I>m f=l 



< pP 



Z^Z c 5Z z ^><i 

l>m j t= 1 



(3.5) 
We have 



+ pP 



Z 10/1 Z Z lc J Ckl Z M,H z i,r-k\ > a\ p y 

I>m j k>j f=l 



I + 11. 



( \ 



a/2 ( 



\a/2 



lim lim sup / = lim 



■a/2 







2>l 2^c? 

U>H1 J \ j 

by a slight modification of the proof of [8, Lemma 3.1]. In fact, one can also map the array (Z,- f ) to a 
sequence and then apply [8, Lemma 3.1] directly. Regarding the second term, note that 



// <pP 
+pP 

As before we conclude that 



Z^ZZ |c ^iZ4-; >a : 

I>m j k>j f=l 

t n 

Z^ZZ^'Z^^V 

l>m j k>j f=l 



iii + n 2 . 



( \ 



z ^ z z ^ 

\l>m J \ j k>j I 

and clearly term IL can be handled similarly. 



lim lim sup IT = hm 

m— >oo n—too m—>oo 



a/2 ( 



a/2 



-a/2 



0, 
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U'k,k'eZ 2 x{K \{0,i,...,p}) 2 f=l 

Therefore we have the bound 

XX T -X^(X^) T < XX 1 -X^(X^) T \\ 



2 



p 

= ™*X1 ^ \cici'8 k eA2_j\^i-k,t-i2j-wj-v 



7=1 /,/',/t,/t'eZ 2 x(]No\{0,l,...,/)|) 2 t=\ 

Observe that the product \Z^k,t-iZj-k' ,t-r\ has tail index a/2 if and only if j — k' = i -k and / = /'. In 
this case we can treat this term like the first term in I in (3.5) and obtain 



l</<£> — i — i n—>oo 
l,k,k'eZx{p+l,p+2,...\ 2 t=l 

since zZk>p \@k\ - > 0. If the product \Zi^,t-iZj~k' ,t-l'\ does not have tail index a/2, i.e., j — k' ± i - k' 
ovl + I' , then the product has only tail index a and can then be treated similarly as the second term II 
in (3.5). 

(iii). By a combination of (i) and (ii) we have that 

all \\XX J - HDH J \l -A 0. 

I II llz fl-^oO 

Thus, by Weyl's inequality, the difference of the largest eigenvalues of XX J and HDH J converges to 
zero. As in the previous section, the final step is to find lower and upper bounds on ||//D// T || 2 . By 
definition of H, we have 

rain{i,j]+p 

(HDH T )ij = £ p _(M)V(W)A. 

l=max{i,j) 

Hence HDH J is no longer a tridiagonal matrix. Recall that the entries of the diagonal matrix D are 
given by D, = £" =1 Xf r By virtue of Lemma 5 an asymptotic lower bound is given by 



a 1 



, H HDH T L >a~j max(HDH T ) u 

a~l max + . . . + 0^ p ) lf a max % £ c). 
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Regarding the upper bound, observe that 

p 

\\hDH j \1 < ||//D// T || = max Y \(HDH T ) 

II 112 II lloo 1<f< > 



Z IV(/-0V(/-;)l D ' 

7=1 /=max{;',;'| 
2p p 

= max D ' Z 1 e-p<M'<'<'+p}l 6 'p-(/-/) 6, /'-(/-y)l 

^ /=1 y=l 

i+p 1 

= ^Z D/I V(M)I Z \°P-(H)\ 

l=i j=l-p 

p p 
= max V D i+l \9 p -i\Y ^ 

7 /=0 yt=0 

so we have to determine the maximum of a moving average of order p of (A), with coefficients 

i0p-zi zLo i^i- B y Lemma 5 > 

p p CO 

(3.6) <„ 2 max V Df+zlfy-zl Y |0*l T[ 2/ff max |0,| V \0 k \ V c 2 

' /=() i=0 fc=0 7 

This completes the proof of Theorem 1 for a < 2. 

Proof of Theorem 1 for a > 2. Since we now consider the spectral norm of XX J - npx,aHH T , one has 
to replace D by the centered diagonal matrix D = D - nfix, a Ip, ie, 

n 

t=\ 

Then one has, with the same truncation as before, that 

(tip) (x(p)) t - W x, a HH J ) - HDH T =a~l \\H(XX J - n^ a I p )H J - H(D - Wx J p )H J \\ 



a„p 



2 n P 

<\\H\$a-l\\XX J -Dt -A 0, 



np ||"" "112 _ 

by an application of Proposition 3. Then one shows, similarly as in Lemma 4, that for each m < oo, 

p oo m 

Z e ^\rU^U^l k ,r^, a )\ ^Z Z Z e r: 2/ ^E, C }- 

£=1 i=l jfc=0 

The extension to the case where ni = oo follows analogously to the proof of [8, Proposition 3.5 (case 
2 < a < 4)]. This establishes Lemma 5 for 2 < a < 4, i.e. 

/? CO CO 

(3 " 7) Z e ^l2r=o*2'/ = ,(^ f -^)l ^Z Z Z V"*^' 

£=1 i'=l i=0 

Then one shows (i)-(iii) with D replaced by D by a straightforward combination of (3.7) and the 
approach used in the proof of Theorem 1 for < a < 2. □ 
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